Efficient re-indexing of automatically annotated image collections using keyword combination
نویسنده
چکیده
This paper presents a framework for improving the image index obtained by automated image annotation. Within this framework, the technique of keyword combination is used for fast image re-indexing based on initial automated annotations. It aims to tackle the challenges of limited vocabulary size and low annotation accuracies resulting from differences between training and test collections. It is useful for situations when these two problems are not anticipated at the time of annotation. We show that based on example images from the automatically annotated collection, it is often possible to find multiple keyword queries that can retrieve new image concepts which are not present in the training vocabulary, and improve retrieval results of those that are already present. We demonstrate that this can be done at a very small computational cost and at an acceptable performance tradeoff, compared to traditional annotation models. We present a simple, robust, and computationally efficient approach for finding an appropriate set of keywords for a given target concept. We report results on TRECVID 2005, Getty Image Archive, and Web image datasets, the last two of which were specifically constructed to support realistic retrieval scenarios.
منابع مشابه
An Efficient Query Mining Framework Using Spatial Hidden Markov Models for Automatic Annotation of Images
A novel method for automatic annotation of images is used with keywords from a generic vocabulary of concepts or objects combined with annotation-based retrieval of images. This can be done by using spatial hidden Markov model, in which states represent concepts. The parameters of this model are estimated from a set of manually annotated training images. An image in a large test collection is t...
متن کاملImage indexing and retrieval using automated annotation
Searching digital information archives on the Internet and elsewhere has become a significant part of our daily lives. Amongst the rapidly growing body of information there are a vast number of digital images. The task of automated image retrieval is complicated by the fact that many images do not have adequate textual descriptions. Retrieval of images through analysis of their visual content i...
متن کاملAutomatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation
Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...
متن کاملContent Based Image Re-ranking using Indexing Methods
With increase in number of digital images, retrieval of images efficiently becomes an important topic for research. Traditional methods for image retrieval used metadata associated with images, commonly known as keywords. These methods empowered many World Wide Web search engines and achieved reasonable amount of accuracy. A novel image re-ranking framework propose, which offline learns differe...
متن کاملText Based Approaches for Content Based Image Retrieval in a P2P Network
The tremendous growth of digital multimedia content on the web requires scalable, efficient, and effective information retrieval mechanisms. Handling such large collections of data in a centralized way requires costly high bandwidth connectivity and powerful servers. This establishes the need of distributed architectures, such as peer-to-peer systems, that allow sharing of data management and s...
متن کامل